Picture for Hao Tang

Hao Tang

LLM-Guided Probabilistic Program Induction for POMDP Model Estimation

Add code
May 04, 2025
Viaarxiv icon

Multimodal Large Language Models for Medicine: A Comprehensive Survey

Add code
Apr 29, 2025
Viaarxiv icon

TTTFusion: A Test-Time Training-Based Strategy for Multimodal Medical Image Fusion in Surgical Robots

Add code
Apr 29, 2025
Viaarxiv icon

ShapeSpeak: Body Shape-Aware Textual Alignment for Visible-Infrared Person Re-Identification

Add code
Apr 25, 2025
Viaarxiv icon

Cabbage: A Differential Growth Framework for Open Surfaces

Add code
Apr 25, 2025
Viaarxiv icon

DMS-Net:Dual-Modal Multi-Scale Siamese Network for Binocular Fundus Image Classification

Add code
Apr 25, 2025
Viaarxiv icon

Token-Shuffle: Towards High-Resolution Image Generation with Autoregressive Models

Add code
Apr 24, 2025
Viaarxiv icon

Multimodal Perception for Goal-oriented Navigation: A Survey

Add code
Apr 22, 2025
Viaarxiv icon

EventVAD: Training-Free Event-Aware Video Anomaly Detection

Add code
Apr 17, 2025
Viaarxiv icon

3D CoCa: Contrastive Learners are 3D Captioners

Add code
Apr 13, 2025
Viaarxiv icon